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Abstract 

O , We show that, for any d, all but a doubly exponentially small fraction of decision trees 

of depth at most d require f2(d) quantum queries to be computed with bounded error. In 
other words, most efficient classical algorithms in the query complexity model do not admit a 
y—i significant quantum speed-up. The proof is based on showing that, with high probability, the 

average sensitivity of a random decision tree is high. 

Q-i 1 Introduction 

Many of the most important examples of quantum algorithms which outperform classical algorithms 
operate in the query complexity model, where the quantity of interest is the number of queries to the 

i input required to compute some function [4]. Can most classical query algorithms be significantly 

accelerated by a quantum computer? In one sense, the answer is a resounding no. It was shown by 
Ambainis in 1999 that almost all boolean functions on n bits require n/4 — 0(^/nlogn) quantum 

t— ( queries to be computed with bounded error [2]; using different techniques, Ambainis et al have very 

recently improved this bound to n/2 — o(n) [3]. 

However, random boolean functions are arguably somewhat uninteresting in general in that 
they are hard both for classical and quantum algorithms to compute. One can interpret the result 
of [2] as saying that there are too many boolean functions, and too few efficient algorithms, whether 
classical or quantum. Perhaps a more interesting question is whether most functions with efficient 
classical algorithms in the query model - i.e. short decision trees - could be computed significantly 
more quickly on a quantum computer than is possible classically. In this note we show that this is 
also not the case. 

We first need to define the model of decision trees which we will use (see [4] for further back- 
ground) . A (boolean) decision tree T is a rooted binary tree where each internal vertex has exactly 
two children. Such trees are called full; only full binary trees will be considered in this paper. Each 
vertex is labelled with a variable x^, 1 < i < n, and each leaf is labelled with or 1, corresponding 
to the output of the tree. Each edge from a node labelled with Xi to its children is labelled with 
or 1, corresponding to the value of Xi. T computes a boolean function T : {0, 1}™ — > {0, 1} in the 
obvious way: starting with the root, the variable labelling each vertex is queried, and dependent 
on whether the answer is or 1 the left or right subtree is evaluated. When a leaf is reached, the 
tree outputs the label of that leaf. The depth of a vertex is defined as follows: the root has depth 
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0, and the depth of any other vertex is equal to the depth of its parent, plus 1. The depth of T is 
the largest depth of any vertex, i.e. the worst-case number of queries made on any input. 

Following [7], we will consider the uniform model of random decision trees. In this model, we 
obtain a random depth d decision tree T on n > d variables simply by picking T uniformly at 
random from the set of all non-redundant decision trees of depth at most d on n variables. A 
decision tree is said to be non-redundant if no variable occurs more than once on any path from 
the root to a leaf. Observe that choosing a boolean function by picking a tree uniformly at random 
does not pick the corresponding function itself uniformly at random from the set of all functions 
with decision trees of depth d, as many different decision trees can represent the same function, 
and some functions may have more trees representing them than others. Also observe that the 
internal structure of a uniformly random decision tree T and the values assigned to its leaves are 
independent. In other words, a random decision tree T of depth at most d on n variables can be 
obtained by a two-stage process: first, pick the structure and labels of the internal vertices of T 
uniformly at random from the set of all decision trees of depth at most d on n variables, then assign 
or 1 to the leaves of T uniformly at random. 

One could also consider an alternative model of random decision trees, known as the complete 
model [7]. In this model, T's structure is always that of a complete binary tree of depth d. We 
randomly assign variables to T's internal vertices, consistent with T being non-redundant, and 
assign or 1 to T's leaves uniformly at random. This model appears somewhat more artificial 
and we therefore concentrate on the uniform model here; however, our results easily extend to the 
complete model. 

We can now state our main result. 

Theorem 1. Let T be a random depth d decision tree on n variables. Then there exists a universal 
constant a > such that Pr[Q 2 (/) < ad] < 2- n ( 2d/3 ). 

In the above theorem, Q%{f) is the bounded-error quantum query complexity of / [4], i.e. the 
smallest number of quantum queries required to compute / with worst-case success probability at 
least 2/3. Observe that the concentration bound obtained is doubly exponentially small in d. This 
result thus implies that the vast majority of efficient classical algorithms in the query complexity 
model do not admit a significant quantum speed-up. 

The counting technique used by Ambainis [1] to prove that most boolean functions do not have 
query-efficient quantum algorithms does not seem to suffice to prove Theorem 1. Indeed, Ambainis 
showed that there are at most 2°( n + ' d ^ boolean functions on n bits that can be computed by 
quantum algorithms making d queries. However, there can be n 0( - 2 ' decision trees of depth d. 
Thus this argument does not disallow, for example, the possibility that random decision trees of 
depth d = 0(log 2 n) could be evaluated using O(logn) quantum queries. 

We therefore take another approach to proving the above theorem, by giving a lower bound on 
the average sensitivity (also known as total influence) of random decision trees. This quantity is 
mathematically tractable and is known to lower bound quantum query complexity [10]. 

2 Proof of Theorem 1 

The average sensitivity of a boolean function / : {0, l} n —> {0, 1} is defined as 

1 n 

ze{o,i} n i=i 
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where x % is the n-bit string obtained by flipping the i'th bit of x. Up to normalisation, s(f) thus 
counts the number of neighbours x, y such that f(x) 7^ f(y). Average sensitivity gives the following 
lower bound on quantum query complexity. 

Theorem 2 (Shi [10], see [11] for the version here). Assume that f : {0, l} n — > {0,1} can be 
computed by a quantum algorithm making q queries and with worst-case failure probability e < 1/2. 
Thenq> \{l - 2e) 2 s{f). 

Our goal will now be to upper bound the probability that the average sensitivity of a random 
decision tree is low. To do so, we will use the following powerful measure concentration result. 

Theorem 3 (See e.g. [9] or [6, Corollary 5.2]). Fix n > and assume that g : {0, l} n — > E satisfies 
\g{x) — g(y)\ < n for all x,y G {0, 1}™ such that d(x,y) = 1, where d(x,y) is the Hamming distance 
between x and y. Let x G {0, l} n be picked uniformly at random. Then 

Pv[g(x)<E x [g^)}-^}<e- 262/inri2) . 

X 

We first calculate the expected average sensitivity of random decision trees. As discussed 
previously, we will choose a uniformly random decision tree by first choosing the structure of the 
tree, then choosing assignments to the leaves uniformly at random. So fix a decision tree T on n 
variables and let C be the set of leaves of T, setting L := \C\. Let T z : {0, l} n — > {0, 1} be the 
boolean function obtained by assigning z G {0, 1} L to the leaves of T; we will eventually apply 
Theorem 3 to the function g : {0, 1} L — > R defined by g(z) = s(T z ). For any leaf £, let d(£) be the 
depth of i. 

Lemma 4. 

eec 

Proof. From the definition of g, 

xe{o,i} n «=i 

Now observe that for each i, if T queries the i'th bit on input x, E 26 | 0)1 |l [|T z (x) — T^x'))] = ^. 
Otherwise, E ze | 0) i}L [|T 2 (x) — T z (x l )\] = 0. So we obtain 

^ze{o,i} L [s( T z)} = 2^+1 ^2 l{* : T 1 uevies i 011 x }\ = ^ E xe{o,i}"[P ath length on input x). 

ze{o,i} n 

As each query to the input gives or 1 with equal probability on a random input x, the probability 
of ending up at a given leaf I on a random input is just 2~ d ^\ The claim follows. □ 

We can also bound r\ = max^^^x \s(T w ) — s(T z )\ as follows. 

Lemma 5. Let w, z G {0, 1} L satisfy d(w, z) = 1. Then 

\s(T w ) - s(T z )\ < mnxd(£)2 l - d( - e \ 



3 



Proof. The idea is essentially the same as the proof of Lemma 4. We have 

1 n 

S(T W ) - s(T z ) = - ^(\T v {x)-T u &)\-\r,(x)-T,&)\). 

As w and z only differ on one leaf £, \T w {x) — T w (x l )\ = \T z (x) — T z (x l )\ unless precisely one of x 
and x 1 leads to £. Thus 

\s(T w )-s(T z )\<~ E E X + ^E E 1= 2^T E 

x.x^rli^^tl x,x^£ i^—tl i,h< 

where we use the notation x ^ £ and x ^> £ to mean that evaluation of the tree does or does not 
end up in leaf £ on input x, respectively. For each x, there can only be at most d(£) variables i 
such that x — )■ £ and x % £ . Therefore 

\s{T w ) - s{T z )\ < 2d(£) Pr [x ^ £] = d{£)2 1 - d ^ . 

xe{o,i} n 

□ 

Note that this bound is quite weak when there exists a leaf £ in T of low depth. Nevertheless, 
its dependence on d(£) is essentially tight: consider the tree T that computes 

T{x) = axi + (1 - xi)x 2 x- i . . . x n , 

for a £ {0, 1}. Then changing a from to 1 changes s(T) from close to to close to 1. Luckily, it 
turns out that in fact min^g^ d{£) will be high with high probability; roughly speaking, a random 
tree is unlikely to have any low-hanging fruit. We formalise this as the following lemma, which we 
prove at the end. 

Lemma 6. Let T be picked uniformly at random from the set of decision trees of depth at most d. 
Then, for any h < d — log 2 d — 2, the probability that T has a leaf with depth at most h is at most 

Assume that d{£) > 2d/2> for all £ € C. Then, using Theorem 3 and Lemma 5, we have 
Pr [s(T z ) < E[s(T z )} -S}< e ~ 252 /(M 2 ) < e -(9/8)2'/»*« /d\ 

z 

Also, if d{£) > 2d/3 for all £ 6 C, by Lemma 4 E[a(T,)] > d/3. Thus 

Pr[ S -(T z )<(l-e)d/3]<e- 2d/3 " V . 

Now let T be picked uniformly at random from the set of decision trees of depth at most d. Taking 
h = 2d/3 in Lemma 6 and using a union bound over the two bad events that min£ g £ d{£) < 2d/3 
and s(T z ) < (1 — e)d/3, we obtain the overall bound that 

P T r[^T)<(l- eW ]<2^ 

which is clearly of order 2 - ^( 2d/3,f2 ). By Theorem 2, Q 2 {T) > s(T)/18. Taking e to be an arbitrary 
constant such that < e < 1 completes the proof of Theorem 1. 

Observe that, if one is content with looser concentration bounds, it is possible to show that 
Pr T [s(T) < d/2 - O(logd)] = 2~ n ^ d \ by taking h = d- O(logd) in Lemma 6. 
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2.1 Remaining lemmas 



In order to prove Lemma 6 we will need a simple bound on the number of low depth binary trees. 
Fact 7. Let be the number of binary trees of depth at most d. For d > 1, = N^_ 1 + 1, and 
N = l. Thus N d > 2 2d_1 for all d>l. 

Proof. Every binary tree of depth at most d either consists of a single leaf or of two independent 
binary trees of depth at most d — 1; the claimed recurrence for is immediate. The second part 
easily follows from this recurrence by induction. □ 

Lemma 8. Let T be picked uniformly at random from the set of binary trees of depth at most d. 
Then, for any h < d — log 2 d — 2, the probability that T has a leaf with depth at most h is at most 

2 l_ 2 d-h-2 

Proof. Using a union bound and Fact 7, the probability that T has a leaf with depth at most h is 
upper bounded by 



h h 2& h h 

Y 2 fe Pr fa vertex at depth A; is a leaf] = ^ — < ^ 2 k ~ 2d ~ k ~ 1 < 2 ~ 2t ^ 2 < 2 1_2<< 

k=0 k=0 d ~ k k=0 k=0 



□ 



Picking a decision tree T uniformly at random from the set of decision trees of depth at most 
d is similar to, but not quite the same as, making T's structure a uniformly random binary tree 
of depth at most d, then assigning variables to the internal vertices at random, and values to the 
leaves at random. However, the above conclusions also apply to uniformly random decision trees 
(in other words, Lemma 8 implies Lemma 6). The reason is that choosing a tree from a distribution 
where trees with different variables assigned to the internal vertices are considered to be distinct 
can only decrease the probability that an arbitrary vertex at a given depth is a leaf. 



3 Conclusions 



We have shown that, in the query complexity model, most short decision trees do not admit a 
quantum speed-up greater than a constant factor. An interesting open question is to determine 
the extent of the quantum speed-up which is possible. It is known that, for all boolean functions 
/ : {0,l} n -> {0,1}, Q 2 (f) < n/2 + 0{y/n) [5]. Could it be the case that all decision trees of 
depth d can be computed with bounded error using d/2 + O(Vd) quantum queries? Indeed, one 
can show that a non-zero speed-up is possible for essentially all decision trees: if / has a decision 
tree of depth d > 2, then Q%{f) < d — 1. The reason is that every boolean function on 2 bits 
can be computed with success probability 9/10 using only one quantum query [8], so the quantum 
algorithm can simply follow the classical decision tree, replacing the last two classical queries with 
one quantum query. 
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